BTT-Go: An Agent for Go that Uses a Transposition Table to Reduce the Simulations and the Supervision in the Monte-Carlo Tree Search
نویسندگان
چکیده
This paper presents BTT-Go: an agent for Go whose architecture is based on the well-known agent Fuego, that is, its search process for the best move is based on simulations of games performed by means of MonteCarlo Tree Search (MCTS). In Fuego, these simulations are guided by supervised heuristics called prior knowledge and play-out policy. In this context, the goal behind the BTT-Go proposal is to reduce the supervised character of Fuego, granting it more autonomy. To cope with this task, the BTT-Go counts on a Transposition Table (TT) whose role is not to waste the history of the nodes that have already been explored throughout the game. By this way, the agent proposed here reduces the supervised character of Fuego by replacing, whenever possible, the prior knowledge and the play-out policy with the information retrieved from the TT. Several evaluative tournaments involving BTT-Go and Fuego confirm that the former obtains satisfactory results in its purpose of attenuating the supervision in Fuego without losing its competitiveness, even in 19x19 game-boards.
منابع مشابه
A Monte Carlo-Based Search Strategy for Dimensionality Reduction in Performance Tuning Parameters
Redundant and irrelevant features in high dimensional data increase the complexity in underlying mathematical models. It is necessary to conduct pre-processing steps that search for the most relevant features in order to reduce the dimensionality of the data. This study made use of a meta-heuristic search approach which uses lightweight random simulations to balance between the exploitation of ...
متن کاملGo-Ahead: Improving Prior Knowledge Heuristics by Using Information Retrieved From Play Out Simulations
The proposal behind this paper is the introduction of a new agent denominated Go-Ahead: this is an automatic Go player that uses a new technique in order to improve the accuracy of the pre estimated values of the moves that are candidates to be introduced into the classical Monte Carlo tree search (MCTS) algorithm which is used by many of the current top agents for Go. Go-Ahead is built upon th...
متن کاملMonte Carlo Modeling of Dual and Triple Photon Energy Absorptiometry Technique
Introduction: Osteoporosis is a bone disease in which there is a reduction in the amount of bone mineral content leading to an increase in the risk of bone fractures. The affected individuals not only have to go through lots of pain and suffering but this disease also results in high economic costs to the society due to a large number of fractures. A timely and accurate diagnosis of this disea...
متن کاملThe Use of Monte-Carlo Simulations in Seismic Hazard Analysis in Tehran and Surrounding Areas
Probabilistic seismic hazard analysis is a technique for estimating the annual rate of exceedance of a specified ground motion at a site due to the known and suspected earthquake sources. A Monte-Carlo approach is utilized to estimate the seismic hazard at a site. This method uses numerous resampling of an earthquake catalog to construct synthetic catalogs to evaluate the ground motion hazard a...
متن کاملConvolutional Monte Carlo Rollouts in Go
In this work, we present a MCTS-based Go-playing program which uses convolutional networks in all parts. Our method performs MCTS in batches, explores the Monte Carlo search tree using Thompson sampling and a convolutional network, and evaluates convnet-based rollouts on the GPU. We achieve strong win rates against open source Go programs and attain competitive results against state of the art ...
متن کامل